Goto

Collaborating Authors

 search strategy



Construction of Hierarchical Neural Architecture Search Spaces based on Context-free Grammars

Neural Information Processing Systems

The discovery of neural architectures from simple building blocks is a long-standing goal of Neural Architecture Search (NAS). Hierarchical search spaces are a promising step towards this goal but lack a unifying search space design framework and typically only search over some limited aspect of architectures. In this work, we introduce a unifying search space design framework based on context-free grammars that can naturally and compactly generate expressive hierarchical search spaces that are 100s of orders of magnitude larger than common spaces from the literature. By enhancing and using their properties, we effectively enable search over the complete architecture and can foster regularity. Further, we propose an efficient hierarchical kernel design for a Bayesian Optimization search strategy to efficiently search over such huge spaces. We demonstrate the versatility of our search space design framework and show that our search strategy can be superior to existing NAS approaches.


EnCompass: Enhancing Agent Programming with Search Over Program Execution Paths

Li, Zhening, Solar-Lezama, Armando, Yue, Yisong, Zheng, Stephan

arXiv.org Artificial Intelligence

We introduce a new approach to agent programming, the development of LLM-based agents. Current approaches to agent programming often entangle two aspects of agent design: the core workflow logic and the inference-time strategy (e.g., tree search). We introduce "probabilistic angelic nondeterminism" ("PAN"), a programming model that disentangles these two concerns, allowing the programmer to describe the agent workflow and independently experiment with different inference-time strategies by simply changing a few inputs. We provide an implementation of PAN in Python as the EnCompass framework, which uses a Python decorator to compile agent workflow programs into a search space. We present three case studies that demonstrate how the framework lets the programmer quickly improve the reliability of an agent and easily switch between different inference-time strategies, all with little additional coding.








Searching Neural Architectures for Sensor Nodes on IoT Gateways

Garavagno, Andrea Mattia, Ragusa, Edoardo, Frisoli, Antonio, Gastaldo, Paolo

arXiv.org Artificial Intelligence

Abstract--This paper presents an automatic method for the design of Neural Networks (NNs) at the edge, enabling Machine Learning (ML) access even in privacy-sensitive Internet of Things (IoT) applications. The proposed method runs on IoT gateways and designs NNs for connected sensor nodes without sharing the collected data outside the local network, keeping the data in the site of collection. This approach has the potential to enable ML for Healthcare Internet of Things (HIoT) and Industrial Internet of Things (IIoT), designing hardware-friendly and custom NNs at the edge for personalized healthcare and advanced industrial services such as quality control, predictive maintenance, or fault diagnosis. By preventing data from being disclosed to cloud services, this method safeguards sensitive information, including industrial secrets and personal data. The outcomes of a thorough experimental session confirm that -on the Visual Wake Words dataset-the proposed approach can achieve state-of-the-art results by exploiting a search procedure that runs in less than 10 hours on the Raspberry Pi Zero 2. Index T erms--Neural Architecture Search, Edge AI, Healthcare Internet of Things, Industrial Internet of Things. Neural Networks (NNs) are widely used in Internet of Things (IoT) applications [1]. In this context, often the data collected by the available sensors are added to the training set with the purpose of improving generalization performances. On the other hand, in some cases, the data can be sensitive; healthcare data [2], industrial data [3] and biometric data [4] provide possible examples. Privacy concerns prevent some entities from accessing the benefits of machine learning (ML), as they may be unable or unwilling to share their data with cloud services that can train or even automatically design a custom neural network (NN) [5].